A Comparison of Keyword- and Keyterm-based Methods for Automatic Web Site Summarization
نویسندگان
چکیده
Automatic Web site summarization, which is based on keyword and key sentence extraction from narrative text, is an effective means of making the content of a Web site easily accessible to Web users. This work is directed towards summary generation based on multi-word terms extracted by the C-value/NC-value method. Keyterm-based summaries are compared with keyword-based summaries for a list of test Web sites. The evaluation indicates that keyterm-based summaries are significantly better than keyword-based summaries, which have previously been shown to be as informative as human-authored summaries.
منابع مشابه
A Comparison of Word- and Term-based Methods for Automatic Web Site Summarization
Automatic Web site summarization is an effective means of making the content of a web site easily accessible to Web users. We demonstrate that a content-based approach to summarization, which is based on keyword and key sentence extraction from narrative text, is able to generate summaries that are as informative as human authored summaries. This work is directed towards summary generation base...
متن کاملComparing Key Phrase Extraction Methods in Automatic Web Site Summarization
We benchmark five methods, TFIDF, KEA, Keyword, Keyterm, and Mixture, for key phrase extraction in the automatic Web site summarization task. We investigate the performance of these methods via a formal user study and demonstrate that Keyterm is the best method for extracting key phrases while Mixture is the best one for obtaining key sentences.
متن کاملA Comparative Study on Key Phrase Extraction Methods in Automatic Web Site Summarization
Web Site Summarization is the process of automatically generating a concise and informative summary for a given Web site. It has gained more and more attention in recent years as effective summarization could lead to enhanced Web information retrieval systems such as searching for Web sites. Extraction-based approaches to Web site summarization rely on the extraction of the most significant sen...
متن کاملA survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملBiogeography-Based Optimization Algorithm for Automatic Extractive Text Summarization
Given the increasing number of documents, sites, online sources, and the users’ desire to quickly access information, automatic textual summarization has caught the attention of many researchers in this field. Researchers have presented different methods for text summarization as well as a useful summary of those texts including relevant document sentences. This study select...
متن کامل